Can Automatic Personal Categorization deal with User Inconsistency?

نویسندگان

  • Dina Goren-Bar
  • Tsvi Kuflik
چکیده

Document categorization is a daily task in every organization, but it is a very subjective process. While automatic document categorization has been widely studied, much challenging research still remains to support user subjective categorization. This study evaluates and compares the application of Self-Organizing Maps (SOM) and Learning Vector Quantization (LVQ) to automatic document classification according to a subjectively predefined set of clusters in a specific domain, and assesses the effect of user inconsistency on this process. Results show that despite the subjective and inconsistent nature of human categorization, automatic document clustering methods correlate well with subjective, personal clustering. Moreover, adapting a system to its users is limited by users' inconsistency, meaning that a perfect adaptation is an impractical goal.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Can Automatic Personal Categorization deal with User Inconsistency?

Document categorization is a daily task in every organization, but it is a very subjective process. While automatic document categorization has been widely studied, much challenging research still remains, to support user subjective categorization. This study evaluates and compares the application of Self-Organizing Maps (SOM) and Learning Vector Quantization (LVQ) to automatic document classif...

متن کامل

Automating Personal Categorization Using Artificial Neural Networks

Organizations as well as personal users invest a great deal of time in assigning documents they read or write to categories. Automatic document classification that matches user subjective classification is widely used, but much challenging research still remain to be done. The self-organizing map (SOM) is an artificial neural network (ANN) that is mathematically characterized by transforming hi...

متن کامل

Supporting user-subjective categorization with self-organizing maps and learning vector quantization

we requested the user to reclassify documents that were misclassified by the system. Results show that despite the subjective nature of human categorization, automatic document categorization methods correlate well with subjective, personal categorization, and the LVQ method outperforms the SOM. The reclassification process revealed an interesting pattern: About 40% of the documents were classi...

متن کامل

A Tool for Individualizing the Web

The increasing complexity of navigating the Internet is becoming one of the fundamental obstacles to its eeective use. This is due to the nature of the Internet, principally, a disorganized collection of both sites and site documents whose exponential growth rate rapidly is outstripping any user's ability to master it. There are two ways to deal with this complexity: reorganize the structure of...

متن کامل

An Interface to Retrieve Personal Memories Using an Iconic Visual Language

Abstract: Relevant past events can be remembered when visualizing related pictures. The main difficulty is how to find these photos in a large personal collection. Query definition and image annotation are key issues to overcome this problem. The former is relevant due to the diversity of the clues provided by our memory when recovering a past moment and the later because images need to be anno...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002